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Summary  of  the  Progress 


1.  Objectives 

The  main  objective  of  the  ongoing  research  program  with  AFOSR  is  to  develop 
a  block-structured  solution  scheme  for  Euler  equations  [1-5].  The 
methodology  is  aimed  at  providing  computational  techniques  for  solving 
three-dimensional,  transonic  flow  problems  around  complex  geometries.  Also, 
we  are  interested  in  treating  such  large  problems  using  large  computers  with 
vectorizing  and  parallel  processing  capabilities. 

Our  past  work  has  been  implemented  on  main  frames  such  as  IBM,  Cray  and 
CYBER  computers.  It  was  based  on  storing  a  large  problem  in  terms  of  many 
smaller  regions  (blocks)  on  available  disc  storage.  It  was  assumed  that  the 
main  memory  is  not  large  enough  to  store  the  entire  problem.  In  these 
applications,  portions  of  this  data  was  brought  into  a  main  memory  which  was 
shared  by  one  or  a  few  processors.  Each  processor  operated  on  its  share  of 
data  (a  single  block).  The  instrumentation  was  requested  to  implement  the 
same  scheme  on  a  computer  with  distributed  memory  and  many  processors.  In 
this  application  the  problem  is  again  divided  into  many  blocks.  However, 
each  block  is  stored  in  the  distributed  memory  of  an  individual  processor 
and  computed  seperately.  Only  the  boundary  conditions  between  the 
neighboring  blocks  are  communicated  between  the  blocks. 

2.  Summary  of  the  Development  Effort 

The  hardware  (Intel  IPSC  hypercube)  was  purchased  and  installed.  It 


was  linked  to  the  other  computers  for  data  transfer.  A  version  of  our  Euler 
solver  was  developed  to  run  on  a  distributed  memory  machine.  Up  to  date. 


two  basic  test  cases  were  computed  [5],  although  the  activity  is  continuing 
at  a  part  of  our  AFOSR  research  program. 

The  flow  field  around  a  NACA0012  profile  at  =  0.80  was  solved  by 

dividing  the  flow  field  into:  4-blocks  with  4000  grid  points,  8-blocks  with 
8000  grid  points  and  16-blocks  with  16000  grid  points.  It  was  demonstrated 
that  by  using  4,  8  and  16  processors  for  each  of  the  above  problems  one 
can  obtain  the  same  computation  time. 

The  second  example  was  to  model  a  16-block  represtentation  of  the  flow 
around  a  wing-body  such  that  16  processors  are  combined  to  each  other 
similar  to  the  way  blocks  are  connected.  This  set-up  provides  an  efficient 
way  of  representing  a  problem  with  a  complex  block  structure  with  minimum 
data  transfer.  In  this  case  the  data  transfer  again  occurs  between  the 
neighboring  blocks. 

The  work  is  ongoing  to  demonstrate: 

the  need  for  increasing  the  number  of  processors  for  solving 
large  problems  as  well  as  their  speeds. 

the  need  for  using  many  loosely  coupled  computers  for  solving 
complex  geometric  problems  so  that  all  computer  resources  (memory, 
CPU,  vectorizing  capabilities  disk  1/0  etc.)  can  be  simultaneously 
uti 1 ized. 

3.  Publications 

Two  papers  were  presented  describing  this  activity: 

1.  Ecer,  A.  and  Spyropoulos,  J.  T.,  Parallel  Processing  Schemes  for 
the  Block-Structured  Solution  of  Transonic  Flows,  1987  Summer 
Computer  Simulation  Conference,  July  1987,  Montreal,  Canada. 


2.  Ecer,  A.  Spyropoulos,  J.  T.  and  M.  E.  Sims,  Parallel  Processing 


Techniques  for  the  Solution  of  Euler  Equations,  AIAA-88-0620,  AIAA 
Aerospace  Summer  Meeting,  Reno,  Nevada,  January  1988. 

Also  two  presentations  were  made  at  two  conferences.  The  written  versions 
of  these  presentations  are  being  prepared. 

1.  Ecer,  A.,  Spyropoulos  J.  T..  and  M.  E.  Sims,  The  solution  of  Euler 
Equations  on  a  Hypercube,  3rd  Conference  on  Hypercube  Computers  and 
Applications,  Pasadena,  California,  January  19  -  12,  1988. 

2.  Ecer,  A.,  Spyropoulos,  J.  T.,  Block-Structured  Solution  of  Euler 
and  Navier-Stokes  Equations.  2nd  International  Symposium  on 
Domain  Decomposition  Methods,  Los  Angeles,  California,  January  14  - 
16,  1988. 


Details  of  the  Performed  Work 
1.  Objectives 

The  objectives  of  the  ongoing  research  were: 

i)  To  develop  a  finited  element  mesh  generation  scheme  that  will 

allow  optimum  grid  configurations  to  be  established  for  different 


flow  zones  [1] , 

ii)  To  develop  a  "zonal"  solution  scheme  for  three-dimensional  tran¬ 
sonic  flows  that  will  allow  independent  grids  to  be  generated  in 
each  flow  zone  and  produce  an  iterative  solution,  coupling  the 
local  solution  of  each  block,  [2,  3]. 

iii)  To  implement  the  "zonal"  solution  scheme  on  computers  with  parallel 
processing  capabilities  [4,  5]. 

This  last  task  was  completed  first  on  IBM  3090,  Cray  XMP  and  CYBER  201 
computers  [4].  The  present  attempt  was  aimed  at  working  with  a  computer 
with  many  processors  with  distributed  memory  and  evaluate  the  reaui rements 
for  computing  large  problems  [5]. 

2.  Solution  Scheme 

The  solution  procedure  involves  a  block-structures  relaxation  scheme 
where  two  basic  computational  tasks  exist: 

.  the  formulation  of  residual  vectors  and 

the  solution  of  a  set  of  implicit  equations. 

In  the  developed  scheme,  the  problem  is  to  define  the  complex  flow 
field  in  terms  of  a  series  of  simpler  blocks.  The  grids  are  generated  for 
each  of  the  blocks  and  stored  seperately  on  the  memory  of  individual 
processors.  The  residuals  are  calculated  for  each  block.  If  each  processor 


system.  The  flow  field  can  be  divided  upto  16  blocks  and  each  block  would 
be  analyzed  on  a  single  processor.  The  data  transfer  between  the 
neighboring  blocks  is  in  terms  of  boundary  conditions  (mass  and  entropy 
fluxes  calculated  through  the  iterations).  The  objective  is  to  ensure  that 
the  conservation  of  mass  and  entropy  is  satisfied  between  the  blocks.  Since 
there  is  an  individual  data  channel  between  the  neighboring  blocks  there 
to  provide  such  information,  considerable  improvement  is  obtained.  In 
solving  large  problems,  data  transfer  plays  an  important  role  in  terms  of 
efficiency.  Compared  to  a  single  processor  with  a  single  I/O  channel,  the 
proposed  system  may  provide  16  times  speed  improvement  in  terms  of 
computational  speed  as  well  as  32  times  speed  improvement  in  terms  of  data 
flow.  Also  because  of  our  computational  scheme  the  data  transfer  between 
the  neighboring  blocks  solve  only  a  small  data  set  (boundary  conditions). 

The  availability  of  large  real  memory  for  each  processor  enables 
elimination  of  many  computations  which  are  represented  through  the 
iterations.  The  performance  figures  on  this  machine  as  related  to  main 
frame  computers  will  be  presented  in  coming  papers. 
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